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Constraint-based flux balance analysis (FBA) has proven successful in predicting the flux distri- 
bution of metabolic networks in diverse environmental conditions. FBA finds one of the alternate 
optimal solutions that maximizes the biomass production rate. Almaas et al have shown that the 
flux distribution follows a power law, and it is possible to associate with most metabolites two 
reactions which maximally produce and consume a give metabolite, respectively. This observation 
led to the concept of high-flux backbone (HFB) in metabolic networks. In previous work, the HFB 
has been computed using a particular optima obtained using FBA. In this paper, we investigate the 
conservation of HFB of a particular solution for a given medium across different alternate optima 
and near-optima in metabolic networks of E. coli and S. cerevisiae. Using flux variability analysis 
(FVA), we propose a method to determine reactions that are guaranteed to be in HFB regardless 
of alternate solutions. We find that the HFB of a particular optima is largely conserved across 
alternate optima in E. coli, while it is only moderately conserved in S. cerevisiae. However, the 
HFB of a particular near-optima shows a large variation across alternate near-optima in both or- 
ganisms. We show that the conserved set of reactions in HFB across alternate near-optima has a 
large overlap with essential reactions and reactions which are both uniquely consuming (UC) and 
uniquely producing (UP). Our findings suggest that the structure of the metabolic network admits a 
high degree of redundancy and plasticity in near-optimal flow patterns enhancing system robustness 
for a given environmental condition. 



Introduction 



The study of the structure and dynamics of complex 
biological networks is important for understanding the 
system- level behaviour of living organisms 0, E| ■ The 
metabolic network is one such physicochemical system 
inside the cell which can be viewed as a complex network 
consisting of metabolites involved in enzyme catalyzed 
reactions. Recent advances in the development of high- 
throughput data collection techniques coupled with sys- 
tematic analysis of fully sequenced genomes has led to the 
reconstruction of organism-specific metabolic networks 
'9;J . Although the list of metabolic reactions along with 
the involved stoichiometries is largely known for many 
organisms, we currently lack the knowledge of most ki- 
netic rate constants and enzyme concentrations in these 
networks. In the absence of detailed knowledge of vari- 
ous kinetic parameters, constraint-based modelling tech- 
niques like Flux balance analysis (FBA), which utilizes 
primarily structural information of the network in the 
form of stoichiometry of various reactions, have proven 
useful in predicting the quantitative behaviour of the 
cellular metabolism under different environmental con- 
ditions d HSU Si- 

FBA is computational technique which can be used 
to obtain a particular flux distribution for a given en- 
vironmental condition that maximizes certain objective 
function which is usually taken to be the biomass produc- 
tion rate (see methods section for details). However, for 
large scale metabolic networks, it has been shown that 
there are a large number of different flux distributions 
or alternate solutions for a given environmental condi- 



tion with the exact same value for the objective function 

[3 0, E3- Thus > FBA finds one of man y possible al- 
ternate optimal solutions for a given environmental con- 
dition. Mixed integer linear programming (MILP) has 
been used to enumerate different alternate optimal flux 
distributions for a given environmental condition [l5l.[2l|. 
However, for large scale metabolic networks, the number 
of alternate optimal solutions has been shown to be ex- 
tremely large, and it is computationally infeasiblc using 
MILP to enumerate all possible alternate optima 21 1. 
Although, it is not feasible to enumerate all alternate 
optima, flux variability analysis (FVA) [n| can be used 
to determine the minimum and maximum flux value of 
each reaction across the set of alternate optimal solu- 
tions for a given environmental condition. Specifically, 
using FVA, we can determine reactions that can have a 
nonzero flux in some alternate optima and the reactions 
whose flux vary across different alternate optima for a 
given environmental condition. 



In the FBA approach, we obtain a particular solution 
by imposing the optimality criterion wherein a flux dis- 
tribution satisfying the governing constraints along with 
maximal biomass production rate is predicted as the phe- 
notype for a given environmental condition. However, 
the optimal flux distribution with maximal biomass pro- 
duction may not always explain the observed behaviour 
for a given environmental condition 1(| 12, 24l [2rl W\. 
Further, wet lab experiments have shown that the organ- 
ism may initially grow suboptimally under certain media, 
i.e., the observed growth rate is less than the maximum 
value predicted by FBA, and then evolve to grow at the 
predicted optimal rate after several generations 1(| T3 |. 
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It is plausible that there are multiple objectives that the 
cell is trying to optimize to different extent under dif- 
ferent environments. Thus, the metabolic network may 
operate under suboptimal growth conditions under cer- 
tain environmental conditions. Hence, it is important to 
study the suboptimal flux distributions, especially, near- 
optimal flux distributions in metabolic networks for dif- 
ferent environmental conditions. 

Using the optimal solution predicted by FBA, Almaas 
et al [2| have shown that the distribution of reaction 
fluxes for a given environmental condition follows a power 
law in the E. coli metabolic network. The observed power 
law distribution implies an inhomogeneity in the reaction 
network usage with most reactions having small fluxes 
and a few reactions having high fluxes. Almaas et al 
showed that the heterogeneity of overall flux distribution 
is also valid to a large extent at the level of individual 
metabolites. Further, for most metabolites, they were 
able to identity two reactions that dominate the produc- 
tion and consumption of a metabolite, respectively. Us- 
ing this property at the level of an individual metabolite, 
Almaas et al 0] have proposed a simple algorithm to con- 
struct a subnetwork referred to as the "high-flux back- 
bone" (HFB) which contains the locally maximal flow 
paths for a given environmental condition in the net- 
work . The HFB for a given environmental condition 
was shown to form a giant connected cluster with large 
overlap with classical pathways described in biochemical 
literature. It has also been suggested that the HFB can 
be used as a network reduction algorithm. 

In this paper, we have studied the activity and flux 
variability of different reactions across alternate flux dis- 
tributions for a given medium under optimal and sub- 
optimal growth conditions in metabolic networks of E. 
coli and S. cerevisiae. Almaas et al have used a sin- 
gle solution obtained using FBA from the set of different 
alternate optima to compute the HFB of the metabolic 
network for a given medium. In this paper, we have in- 
vestigated in detail the effect of alternate optima on the 
set of reactions in HFB computed using a single solution 
for a given medium in E. coli and S. cerevisiae. We have 
further investigated the effect of alternate suboptimal so- 
lutions corresponding to near-optimal growth condition 
on the set of reactions in HFB computed using a sin- 
gle suboptimal solution for a given medium in the two 
organisms. 

Results and Discussion 

In this section, we present results from our study of 
the two metabolic networks: E. coli (version UR904 
[22j ]) and S. cerevisiae (version iND750 0]). The two 
networks can be downloaded from the BiGG database: 
:/ '/bigg. ucsd.edu/ '. The E. coli metabolic network 
UR904 contains 761 metabolites and 931 reactions and 



the S. cerevisiae metabolic network iND750 contains 
1061 metabolites and 1149 reactions. For each organ- 
ism, the two databases additionally provide a biomass 
production reaction which gives the relative contribution 
of key metabolites towards cellular biomass. We have 
investigated here the two metabolic networks under dif- 
ferent environmental conditions corresponding to 89 and 
43 aerobic minimal media for E. coli and S. cerevisiae, 
respectively. Using FBA, it has been predicted elsewhere 
that this set of 89 and 43 aerobic minimal media for E. 
coli and S. cerevisiae, respectively, considered here are 
the only aerobic minimal media that can support nonzero 
growth in the two networks [23| . 

Activity and flux variability of reactions under 
optimal and near-optimal conditions 

We can determine the maximum and minimum flux 
through each reaction that supports optimal growth in 
a minimal medium using FVA (see methods section for 
details). A reaction is considered to be possibly active in 
a medium across alternate optima, if the maximum flux 
value that supports optimal growth is nonzero. A reac- 
tion is considered to have a variable flux in a medium 
across alternate optima, if the difference between max- 
imum and minimum flux value that supports optimal 
growth is nonzero. 

Using FVA, we determined the number of reactions 
that can be possibly active and the reactions with vari- 
able flux across alternate optima for each of the 89 dif- 
ferent minimal media in the E. coli metabolic network. 
The number of reactions that can be possibly active in 
a medium varies between 278 and 332 reactions across 
the 89 minimal media. The number of reactions with 
variable flux in a medium varies between 34 and 83 reac- 
tions across the 89 minimal media. These results for the 
E. coli metabolic network are shown in Fig. [lja). We 
find that 11 — 25% of the reactions that can be possibly 
active in a given medium to have variable flux across al- 
ternate optima for that medium. The results obtained 
here for the E. coli metabolic network using FVA for 89 
different aerobic minimal media are consistent with the 
results obtained earlier by Reed and Palsson [5l| using 
a larger set of alternate optima for 136 different aerobic 
and anaerobic minimal media. 

We also determined the number of reactions that can 
be possibly active and the reactions with variable flux 
across alternate optima for each of the 43 different mini- 
mal media in the S. cerevisiae metabolic network. These 
results for the S. cerevisiae metabolic network are shown 
in Fig. Hlja). We find that 22 - 69% of the reactions 
that can be possibly active in a given medium to have 
variable flux across alternate optima for that medium 
in S. cerevisiae. For one of the 43 minimal media, i.e., 
arginine aerobic minimal medium, we find a much larger 
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FIG. 1: Activity and flux variability of reactions under (a) optimal growth, (b) near-optimal growth (> 99% of the optimal 
rate) and (c) suboptimal growth (> 5% of the optimal rate) for 89 different minimal media in the E. coli metabolic network. 
Each bar corresponds to one of the 89 different minimal media. The height of the bar gives the number of reactions that can 
be possibly active and the height upto which the bar is filled with grey colour gives the number of reactions whose flux vary 
across alternate solutions for that medium. 



number of possibly active reactions and reactions with 
variable flux across alternate optima compared to other 
minimal media (cf. Fig. [2ta)). A closer look at the 
data for arginine minimal medium showed that many 
transport reactions involved in excretion of ammonia, 
guanine, urea and xanthine have a variable flux across 
alternate optima. This observation explains the larger 
flexibility under arginine minimal medium within the S. 
cerevisiae metabolic network to achieve optimal growth 
rate compared to other media. Even if we exclude the 
results for the arginine minimal medium, we find that 
the flux variability for a typical minimal medium in S. 
cerevisiae is greater than the flux variability for a typical 
minimal medium in E. coli under optimal growth con- 
dition. This finding suggests a more evolved structure 
of the metabolic network inside the eukaryote S. cere- 
visiae with greater redundancy compared to that of the 
prokaryote E. coli. 

We investigated both metabolic networks under two 
suboptimal scenarios: (a) where the growth rate is > 99% 
of the optimal growth rate and (b) where the growth rate 



is > 5% of the optimal growth rate. The first of the two 
suboptimal scenarios will be referred to as 'near-optimal 
condition' as we account for all flux distributions that al- 
low growth at > 99% of the optimal rate. In the second 
suboptimal scenario, we account for all flux distributions 
which allow growth at > 5% of the optimal rate. We 
used FVA to determine the number of possibly active re- 
actions and the reactions with variable flux in a medium 
across alternate suboptimal solutions for different min- 
imal media. From here onwards, we refer to the set of 
alternate optimal flux distributions for a given minimal 
medium in short as 'alternate optima'. For the first sub- 
optimal scenario, we refer to the set of alternate subop- 
timal flux distributions which have near-optimal growth 
rate for a given minimal medium in short as 'alternate 
near-optima'. For the second suboptimal scenario, we re- 
fer to the set of alternate suboptimal flux distributions 
which have growth rate > 5% of the optimal rate for a 
given minimal medium in short as 'alternate suboptima'. 

Using FVA, we determined the number of reactions 
that can be possibly active and the reactions with vari- 
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Different media 

FIG. 2: Activity and flux variability of reactions under (a) optimal growth, (b) near-optimal growth (> 99% of the optimal 
rate) and (c) suboptimal growth (> 5% of the optimal rate) for 43 different minimal media in the S. cerevisiae metabolic 
network. Each bar corresponds to one of the 43 different minimal media. The height of the bar gives the number of reactions 
that can be possibly active and the height upto which the bar is filled with grey colour gives the number of reactions whose 
flux vary across alternate solutions for that medium. 



able flux in the two suboptimal scenarios for each of the 
89 different minimal media in the E. coli metabolic net- 
work. The number of reactions that can be possibly ac- 
tive in a given medium is obtained to be close to 600 in 
both suboptimal scenarios. Almost all reactions which 
can be possibly active have variable flux across alternate 
near-optima in the first suboptimal scenario. Further, all 
reactions which can be possibly active have variable flux 
across alternate suboptima in the second suboptimal sce- 
nario. These results for the E. coli metabolic network are 
shown in Fig. QJb) and (c). We find that most reactions 
which can be possibly active in a given medium can have 
a variable flux across alternate solutions in near-optimal 
growth condition (cf. Fig. O^b)). We found similar re- 
sults for both suboptimal conditions in the S. cerevisiae 
metabolic network (cf. Fig. [DJb) and (c)). We empha- 
size that although the set of reactions with variable flux 
under both suboptimal scenarios is almost the same, the 
amount of flux variability of a reaction is larger in the 
second suboptimal scenario compared to the first near- 
optimal scenario. 



By comparing the number of possibly active reactions 
in each minimal medium investigated in E. coli (cf. Fig. 
QJa) and (b)) and S. cerevisiae (cf. Fig. Efa) and (b)) un- 
der optimal and near-optimal conditions, we can clearly 
see that the number is significantly more in the near- 
optimal condition. Thus, a larger set of reactions can 
be recruited under near-optimal condition compared to 
strictly optimal condition. This suggests a high degree 
of flux plasticity in the metabolic network under near- 
optimal condition. In a recent paper, Nishikawa et al 



17[ have shown that the maximization of a linear objec- 



tive function, such as growth rate or ATP production, in 
FBA to obtain a particular flux distribution leads to sig- 
nificantly lower number of active reactions under optimal 
condition compared to typical non-optimal states. They 
attribute this massive spontaneous reaction silencing to 
the irreversibility of reactions and cascade of inactivity 
propagated by irreversible reactions in the network. Our 
results obtained using FVA add to the results obtained 
by Nishikawa et al [12J. We have shown here that, even 
in near-optimal growth conditions, the number of possi- 
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bly active reactions and the reactions with variable flux 
across alternate near-optima is much larger compared to 
strictly optimal scenario. 

We end this subsection by mentioning an important 
methodological detail. While implementing FVA, we 
need to account for reactions that are involved in futile 
internal cycles. An internal cycle or Type III extreme 
pathway is a set of internal reactions that can satisfy the 
steady state mass balance condition without involvement 
of any exchange flux 

UBS. Such cycles are thermody- 
namically infeasible and need to be eliminated in any 
steady state analysis. We determined all internal cycles 
in the two metabolic networks studied here, and based 
on that, decided to constrain the flux of 8 reactions in 
E. coli and 16 reactions in S. cerevisiae to zero to avoid 
futile cycles in our simulations (sec methods section for 
details). 



High-flux backbone under optimal and near-optimal 
conditions 



In the previous subsection, we have shown that there 
is variation in the activity as well as flux value of reac- 
tions across the alternate optima for a given medium in 
both E. coli and S. cerevisiae. Further, we have shown 
that the number of possibly active reactions and reactions 
with variable flux across alternate near-optima is signifi- 
cantly higher in near-optimal condition compared to the 
strictly optimal case. In this subsection, we investigate 
the effect of alternate flux distributions on the set of re- 
actions in the high-flux backbone (HFB) obtained using 
a particular solution in both optimal and near-optimal 
conditions. 

Using mixed integer linear programming (MILP) (lBI . 
211 ] . we obtained 100 alternate optima and 100 alternate 
near-optima for each of the 89 aerobic minimal media in 
E. coli and 43 aerobic minimal media in S. cerevisiae (see 
methods section for details). For each minimal medium 
investigated, we obtained the 100 HFB corresponding to 
100 alternate optima. We then calculated the average 
number of reactions in HFB across the alternate optima 
for each medium. These results for the 89 minimal media 
in E. coli and 43 minimal media in S. cerevisiae under 
optimal growth conditions are shown in Fig. [3](a) and 
Fig. HJa), respectively. Similarly, we obtained the av- 
erage number of reactions in HFB across the alternate 
near-optima for each medium. These results are shown 
for E. coli and S. cerevisiae in Fig. [3](b) and Fig. E^b), 
respectively. 

In case of both E. coli and S. cerevisiae, for a given 
medium, we found the set of reactions in HFB corre- 
sponding to different alternate optima to differ from each 
other. This was also true for the the set of reactions in the 
HFB corresponding to different alternate near-optima. 
However, the number of alternate optima or alternate 



near-optima sampled here is too small, and the data can- 
not be used to determine the overlap among the reactions 
in HFB corresponding to different alternate solutions for 
a given medium. Thus, we devise a simple method using 
FVA to determine the set of reactions which are guaran- 
teed to be in any HFB corresponding to different alter- 
nate solutions for a given medium (see methods section 
for details). We used this method to determine the set of 
reactions that are guaranteed to be in HFB correspond- 
ing to any alternate optima for a given medium in the 
two networks. We also obtained the set of reactions that 
are guaranteed to be in HFB corresponding to different 
alternate near-optima for a given medium in the two net- 
works. These results are shown for the 89 minimal media 
in E. coli in Fig. [3ja) and (b) and for the 43 minimal 
media in S. cerevisiae in Fig. 0Ja) and (b). 

We find that at least 3 — 13% of reactions in HFB cor- 
responding to a particular optima for a given medium 
are not guaranteed to in HFB corresponding to some al- 
ternate optima for that medium in the E. coli metabolic 
network (cf. Fig. G^a)). However, at least 34 - 42% 
of reactions in HFB corresponding to a particular near- 
optima for a given medium are not guaranteed to be in 
HFB corresponding to some alternate near-optima for 
that medium in E. coli (cf. Fig. EJb)). Similarly, we 
find that at least 11 — 45% of reactions in HFB cor- 
responding to a particular optima for a given medium 
are not guaranteed to be in HFB corresponding to some 
alternate optima for that medium in the S. cerevisiae 
metabolic network (cf. Fig. Ufa)). Even if we discount 
the arginine minimal medium, which has an uncharac- 
teristically larger flux variability in S. cerevisiae under 
optimal conditions (cf. Fig. [2]^a)), still at least 11 — 20% 
of reactions in HFB corresponding to a particular op- 
tima for a given medium are not guaranteed to be in 
HFB corresponding to some alternate optima for that 
medium. Further, at least 36 — 47% of reactions in HFB 
corresponding to a particular near-optima are not guar- 
anteed to be in HFB corresponding to some alternate 
near-optima for that medium in S. cerevisiae (cf. Fig. 
lib))- 

Note that we have used the simplex method to solve 
the LP problem in order to sample different alternate op- 
tima and near-optima. Simplex method tries to return 
a particular solution or flux distribution from the set of 
alternate solutions that has least number of nonzero re- 
action fluxes in it [l[. Thus, our results obtained using 
MILP regarding the average number of reactions in HFB 
corresponding to a particular solution computed using 
100 alternate solutions for a given medium may change 
with exhaustive sampling. However, our results obtained 
using FVA regarding the set of reactions that are guaran- 
teed to be in HFB corresponding to any alternate optima 
or suboptima for a given medium is independent of sam- 
pling. 

We have shown here that the set of reactions in HFB 
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FIG. 3: Histogram of the average number of reactions in HFB under (a) optimal growth and (b) near-optimal growth for 89 
different minimal media in the E. coli metabolic network. Each bar corresponds to one of the 89 different minimal media. The 
height of the bar gives the average number of reactions in the HFB computed using 100 alternate solutions obtained using 
MILP for each minimal medium. The height upto which the bar is filled with grey colour gives the number of reactions which 
are guaranteed to be in HFB of all alternate solutions for that medium. 




Different media 



FIG. 4: Histogram of the average number of reactions in HFB under (a) optimal growth and (b) near-optimal growth for 43 
different minimal media in the S. cerevisiae metabolic network. Each bar corresponds to one of the 43 different minimal media. 
The height of the bar gives the average number of reactions in the HFB computed using 100 alternate solutions obtained using 
MILP for each minimal medium. The height upto which the bar is filled with grey colour gives the number of reactions which 
are guaranteed to be in HFB of all alternate solutions for that medium. 
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depends upon the particular solution chosen from the set 
of alternate optima to compute it in both E. coli and 
S. cerevisiae. Under optimal condition, only 3 — 11% 
of the reactions in HFB of a particular solution are not 
guaranteed to be in HFB of some other alternate op- 
tima in E. coli which is relatively small. But the corre- 
sponding fraction in 5*. cerevisiae for optimal condition 
is much higher. Also, we have shown for near-optimal 
condition, greater than 34% of the reactions in HFB of a 
particular near-optima are not guaranteed to be in HFB 
of some other alternate near-optima in both organisms. 
Thus, the set of reactions in HFB significantly varies un- 
der near-optimal conditions across alternate near-optima 
for a given medium in both organisms. These findings 
suggest that the flow patterns in metabolic networks for 
a given medium in near-optimal conditions is much more 
complex and degenerate compared to that under strictly 
optimal conditions. 

Our findings also imply that even for a given medium 
the structure of the metabolic network has evolved so 
as to allow both structural and flux plasticity @ in the 
level of reaction usage to achieve the same optimal or 
near-optimal growth objective through multiple flux dis- 
tributions. Structural plasticity associated with a given 
medium can result in alternate equally efficient pathways 
or metabolic routes to be recruited in different alternate 
solutions. Flux plasticity associated with a given medium 
can result in different value of reaction fluxes in different 
alternate solutions. Both structural and flux plasticity 
associated with alternate flux distributions for a given 
medium renders the set of reactions in HFB of a particu- 
lar solution for that medium variable across alternate so- 
lutions. However, we still find that, under near-optimal 
condition, at least 53% of the reactions in HFB of a par- 
ticular solution are conserved across HFB of any alter- 
nate near-optima for a given medium in both organisms. 
In contrast, we have shown in the last subsection for near- 
optimal condition that, almost all reactions that can be 
possibly active for a given minimal medium have a vari- 
able flux across alternate near-optima for that medium. 

We now shed light on why at least 53% of the reactions 
in HFB of a particular solution are conserved across HFB 
of any alternate near-optima for a given medium when 
almost all reactions that can be possibly active for that 
medium show variation in their flux value. Since, the set 
of conserved reactions across HFB of any alternate near- 
optima are always active for a given medium, it is natural 
to expect this set of reactions to have a high overlap with 
essential reactions for growth in that medium. We find 
that on average 90% of reactions guaranteed to be in 
HFB of alternate near-optima are essential for a given 
medium in E. coli while the corresponding fraction in S. 
cerevisiae is 80% (cf. Fig. [5]). Thus, the conserved part 
of HFB across alternate solutions for a given medium has 
high density of essential reactions. 

It has been shown elsewhere that almost all essential 



reactions in the metabolic network get explained by their 
association with low degree uniquely produced (UP) or 

~1- Notice that 



If 



uniquely consumed (UC) metabolite 
if a given reaction consumes a UC metabolite and pro- 
duces a UP metabolite, and the reaction has a nonzero 
flux value across all alternate near-optima for a given 
medium, then it is guaranteed to be in HFB of any alter- 
nate near-optima. We find that on average 80% of reac- 
tions guaranteed to be in HFB of alternate near-optima 
consume a UC metabolite and produce a UP metabolite 
for a given medium in E. coli while the corresponding 
fraction for S. cerevisiae is 77% (cf. Fig. [6]). Thus, the 
conserved part of HFB across alternate solutions for a 
given medium has high density of reactions that satisfy 
both UP and UC property. In determining UP and UC 
metabolites for a given medium, we removed the blocked 
reactions @, [25} for that medium as outlined in Samal 
et al H3. 

The above mentioned results show that most reactions 
that are conserved in HFB regardless of alternate near- 
optima for a given medium get explained by the absence 
of multiple paths to carry out certain processes in the 
metabolic network. Our findings also suggests that, in 
most cases, if a metabolite has multiple reactions pro- 
ducing it and multiple reactions consuming it, we can- 
not associate the metabolite with a single reaction that 
produces it maximally across all alternate near-optima 
and a single reaction that consumes it maximally across 
all alternate near-optima. These results show that, un- 
der near-optimal conditions, the metabolic network of E. 
coli and S. cerevisiae exhibit a high degree of flux plas- 
ticity for a given minimal medium contributing towards 
the robustness of the system. 



Conclusions 

In this paper, we have investigated the effect of alter- 
nate optima and near-optima for a given medium on the 
set of reactions in the high- flux backbone (HFB) com- 
puted using a particular solution for that medium in the 
E. coli and S. cerevisiae metabolic network. Using flux 
variability analysis (FVA), we have shown here that the 
number of reactions that can be possibly active and re- 
actions with variable flux across alternate solutions for a 
given medium under strictly optimal condition is signif- 
icantly less than that in near-optimal condition for both 
metabolic networks. Since, the set of reactions in the 
HFB of a particular solution may vary across different 
alternate solutions for a given medium, we devised a sim- 
ple method using FVA to determine the set of reactions 
which are guaranteed to be in HFB of any alternate solu- 
tion for a given medium. We find that the set of reactions 
in HFB of particular optima for a given medium shows 
small variation across alternate optima for that medium 
in E. coli and moderate variation in S. cerevisiae. How- 
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FIG. 5: Histogram of the fraction of reactions guaranteed to be in HFB of any alternate near-optima that are essential for (a) 
89 minimal media in E. coli and (b) 43 minimal media in S. cerevisiae. 
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FIG. 6: Histogram of the fraction of reactions guaranteed to be in HFB of any alternate near-optima that satisfy both UP and 
UC property for (a) 89 minimal media in E. coli and (b) 43 minimal media in S. cerevisiae. 
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ever, this variation is significantly large in both organ- 
isms when we consider the alternate near-optima for a 
given medium. We show that the majority of reactions 
that are guaranteed to be in HFB of any alternate near- 
optima for a given medium are also essential for growth in 
that medium. We also show that majority of reactions in 
the conserved HFB set across alternate near-optima for a 
given medium also satisfy both the uniquely consuming 
(UC) and uniquely producing (UP) property. Our results 
show that the structure of the metabolic network allows 
a high degree of structural and flux plasticity within a 
given environmental condition under near-optimal con- 
ditions which enhances the robustness of metabolic net- 
works in both organisms. 



most commonly used objective function is the maximiza- 
tion of biomass production rate. The LP formulation of 
the FBA approach is as follows: 

Maximize Z = c T v 
such that S.v = 

a < Vi < (2) 

where vector c corresponds to the objective function, a 
and f3 are the lower and upper limits of individual flux 
values. 

Mixed integer linear programming 



Methods 
Flux balance analysis 

Flux balance analysis (FBA) is a computational mod- 
elling technique which can be used to obtain a prediction 
for the fluxes of all reactions in the metabolic network 
and growth rate of the organism [FJ, |2CJ, |28| . The main 
assumptions made in the technique of FBA are: (a) the 
network reaches a steady state for any given environmen- 
tal condition wherein the concentration of all internal 
metabolites and velocities of all reactions are constant, 
and (b) the cell tries to adjust its intracellular machin- 
ery or reaction fluxes so as to maximize its growth rate or 
biomass production. The key requirement for implement- 
ing FBA is the information about the stoichiometry of all 
metabolic reactions known to occur in an organism. The 
complete list of reactions along with their stoichiometry 
is encapsulated in the stoichiometric matrix S of dimen- 
sions m x n where m denotes the number of metabolites 
and n denotes the number of reactions. In any metabolic 
steady state, each metabolite achieves a dynamic mass 
balance and the vector v = (vi,V2,- ■ ■ ,v n ) T representing 
the fluxes through each reaction must satisfy the equa- 
tion 



S.v = 0, 



(1) 



so as to not violate mass conservation. Eq. [T] gives us 
a system of linear equations relating various fluxes in 
the network. In addition to linear stoichiometric con- 
straints, there exist thermodynamic constraints that may 
render certain reactions irreversible and flux capacity 
constraints which impose limitations on maximum flux 
through certain reactions. These additional constraints 
can be used to further limit the space of allowable fluxes 
given by Eq. [T] To obtain a particular solution from 
the space of allowable fluxes, linear programming (LP) is 
used to find a flux distribution in the allowable space that 
gives the optimal value for an objective function. The 



For most large scale metabolic networks, FBA finds 
one of many possible alternate optimal flux distributions 
that have the same objective value for a given environ- 
mental condition [l5|, [ly, Hl[ . We can use mixed integer 
linear programming (MILP) to enumerate different alter- 
nate optima. We have used here the recursive MILP al- 
gorithm proposed by Reed and Palsson 2l| to enumerate 
different alternate optima. In this method, in addition to 
constraints in the LP formulation of FBA given by Eq. 
[5J the following new constraints are imposed to obtain 
different solutions: 

E y^ 1 

Wi > \NZ k \ - 1, k= 1... J - 1 

i£NZ J 

y% + Wi < l, Wi 

a ■ u>i < Vj < (3 • Wi Vi (3) 

In this method each reaction i in the network is associ- 
ated with two binary variables yi and Wi. NZ J represents 
the set of reactions with nonzero flux value in the solu- 
tion at iteration J. Given the constraints in Eq. [3J if 
the binary variable yi = l then Wi=0, which forces both 
upper and lower bounds for reaction i to be zero. Thus, 
setting yi=l forces the reaction i to be removed from the 
basis of the next iteration. In this recursive algorithm, 
at each iteration J, at least one of the nonzero fluxes of 
the previous solution (NZ J ~ r ) is removed from the ba- 
sis of the current iteration. The MILP problem given by 
Eq. [2] and [3] is then solved again to generate a different 
alternate solution. 



Flux variability analysis 

The number of alternate optima in large scale 
metabolic networks is extremely large, and it can be com- 
putationally infeasible to enumerate all alternate optima. 
Flux variability analysis (FVA) [l6| is an alternative tech- 
nique which can be used to determine the flux variability 
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associated with each reaction in the complete set of alter- 
nate optima for a given medium. Initially, the maximum 
value of the objective function, i.e., Z f,j, is determined. 
This is followed by solving two LP problems for each re- 
action i in the network. The LP formulation of the FVA 
approach is as follows: 

Case 1: 

Maximize Vi 
such that S.v = 

c T v = Z obj 

a < Vi < (3 for i = 1 . . . n (4) 

Case 2: 

Minimize Vi 
such that S.v = 

c T v = Z bj 

oi < Vi < (3 for i = 1 ... n (5) 

In the first case, the maximum flux through each reaction 
that supports optimal growth rate Z bj is determined. In 
the second case, the minimum flux through each reaction 
that supports optimal growth rate Z a bj is determined. 
The difference of maximum and minimum value that sup- 
ports optimal growth rate for each reaction gives the flux 
variability of that reaction. A slight modification of the 
LP problem given by Eq. [4] and [5] is able to generate the 
flux variability under suboptimal conditions. The subop- 
timal condition corresponding to near-optimality studied 
here requires the following change in Eq. [4] and [5j Re- 
place the condition c T v=Z bj with c T v>0.99Z o (,j. 

Reactions involved in futile cycles 

We used a recently published algorithm by Wright and 
Wagner [2^] to compute the reactions in E. coli and S. 
cerevisiae metabolic network that are involved in inter- 
nal cycles 0, [lj| of size > 3. The number of reactions 
involved in internal cycles of size > 3 in E. coli and S. 
cerevisiae was obtained to be 22 and 60, respectively. Us- 
ing METATOOL version 5.0 [HI, we obtained a total of 
9 and 28 internal cycles of size > 3 in the metabolic net- 
work of E. coli and S. cerevisiae, respectively. In order 
to avoid these internal cycles, we followed the standard 
procedure of constraining at least one reaction flux in 
each internal cycle to zero. By examining the different 
internal cycles of size > 3, we decided to constrain the 
flux of following reactions: ABUTt2, ACCOAL, ADK1, 
GLUT4, PROt4, SERt4, THRt4 and VPAMT in the E. coli 
metabolic network UR904 and the flux of follow- 
ing reactions: ASPTAm, ACONTm, DCMPDA, FRDcm, 
GHMT2r, GLUT5m, MALt2r, NDPK1, NDPK8, NDPK9, 
PHCDm, SHSL1, SUCCtm, SUCCFUMtm, SUCD2_u6m and 



UGLT in the S. cerevisiae metabolic network iND750 Q 
to zero to avoid futile cycles. The abbreviations of the 
reactions in the two metabolic networks mentioned above 
are same as in the databases UR904 and iND750, respec- 
tively. 

High-flux backbone 

The method to determine high-flux backbone (HFB) 
for a given medium using a particular solution is as fol- 
lows [2j: 

(a) For each metabolite in the network, remove all 
reactions except the reaction that produces the 
metabolite with largest flux and the reaction that 
consumes the metabolite with largest flux. The 
metabolites that are either not produced (con- 
sumed) are not taken into account. 

(b) If metabolite A is an educt and metabolite B is 
a product of reaction R, and further R maximally 
consumes A and maximally produces B, then reac- 
tion R is part of the HFB. 

Since, the set of reactions in HFB of a particular solution 
may change if it was obtained from a different alternate 
solution, we here propose a simple method using FVA 
to determine the set of reactions that are guaranteed to 
be in HFB of all alternate solutions for a given medium. 
Our method is as follows: 

(a) Using FVA, we obtain the minimum and maximum 
flux of each reaction R in the network that can sup- 
port optimal growth for a given minimal medium. 

(b) We designate reaction R as producing (consuming) 
metabolite A with largest flux across all alternate 
optima, if the minimum flux for R obtained using 
FVA is greater than the maximum flux for each 
reaction other than R that produces (consumes) 
A. 

(c) For each metabolite in the network, remove all reac- 
tions except the reaction that produces the metabo- 
lite with largest flux and the reaction that con- 
sumes the metabolite with largest flux across all 
alternate optima. 

(d) If metabolite A is an educt and metabolite B is 
a product of reaction R, and further R maximally 
consumes A and maximally produces B across all 
alternate optima, then reaction R is contained in 
HFB of all alternate optima. 

Our method described here for alternate optimal solu- 
tions can also be used to determine the set of reactions 
that are guaranteed to be in HFB of all alternate near- 
optimal solutions. 
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